首页> 外文OA文献 >Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network

【2h】

Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network

机译：深度卡拉OK：使用卷积深度神经网络从混合音乐中提取人声

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Identification and extraction of singing voice from within musical mixtures is a key challenge in source separation and machine audition. Recently, deep neural networks (DNN) have been used to estimate 'ideal' binary masks for carefully controlled cocktail party speech separation problems. However, it is not yet known whether these methods are capable of generalizing to the discrimination of voice and non-voice in the context of musical mixtures. Here, we trained a convolutional DNN (of around a billion parameters) to provide probabilistic estimates of the ideal binary mask for separation of vocal sounds from real-world musical mixtures. We contrast our DNN results with more traditional linear methods. Our approach may be useful for automatic removal of vocal sounds from musical mixtures for 'karaoke' type applications.

机译：从音乐混音中识别和提取歌声是音源分离和机器试听的关键挑战。最近，深度神经网络（DNN）已用于估计“理想”二进制掩码，以解决鸡尾酒派对语音分离问题。但是，尚不知道这些方法是否能够推广到混音环境中对语音和非语音的区分。在这里，我们训练了卷积DNN（约有十亿个参数），以提供将理想声音从真实世界音乐混音中分离出来的理想二进制掩码的概率估计。我们将DNN结果与更传统的线性方法进行对比。对于“卡拉OK”类型的应用，我们的方法对于从音乐混合中自动删除声音可能很有用。

著录项

作者
Simpson, AR; Roma, G; Plumbley, M;
展开▼
作者单位

展开▼
年度 2015
总页数
原文格式 PDF
正文语种 en
中图分类
入库时间 2022-08-20 20:30:01

相似文献

外文文献
中文文献
专利

1. Mixtures of Lightweight Deep Convolutional Neural Networks: Applied to Agricultural Robotics [J] . Chris McCool, Tristan Perez, Ben Upcroft IEEE Robotics and Automation Letters . 2017,第3期

机译：轻型深度卷积神经网络的混合物：应用于农业机器人
2. Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning [J] . Zhu Li, Chang Weike Journal of Medical Imaging and Health Informatics . 2019,第8期

机译：深度卷积神经网络在注意力缺陷/多动障碍分类中的应用：数据增强与卷积神经网络转移学习
3. An artificial target detection method combining a polarimetric feature extractor with deep convolutional neural networks [J] . Sun Rui, Sun Xiaobing, Chen Feinan, International journal of remote sensing . 2020,第13a14期

机译：具有深度卷积神经网络的偏振特征提取器的人工目标检测方法
4. Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network [C] . Andrew J.R. Simpson, Gerard Roma, Mark D. Plumbley International conference on latent variable analysis and signal separation . 2016

机译：深度卡拉OK：使用卷积深度神经网络从音乐混合中提取人声
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Deep neural networks show an equivalent and often superior performance to dermatologists in onychomycosis diagnosis: Automatic construction of onychomycosis datasets by region-based convolutional deep neural network [O] . Seung Seog Han, Gyeong Hun Park, Woohyung Lim, -1

机译：深度神经网络在灰指甲诊断中显示出与皮肤科医生相当且通常优于皮肤病的性能：通过基于区域的卷积深度神经网络自动构建灰指甲数据集
7. Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network [O] . Simpson, Andrew J. R., Roma, Gerard, Plumbley, Mark D. 2015

机译：深卡拉OK：使用音乐从音乐混合物中提取人声卷积深度神经网络

Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network

摘要

著录项

相似文献

相关主题

期刊订阅